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ABSTRACT 

The problem of target localization with noise is addressed. The 
target is a sample from a continuous random variable with known 
distribution and the goal is to locate it with minimum mean squared 
error distortion. The localization scheme or policy proceeds by 
queries, or questions, weather or not the target belongs to some sub¬ 
set as it is addressed in the 20-question framework. These subsets 
are not constrained to be intervals and the answers to the queries 
are noisy. While this situation is well studied for adaptive querying, 
this paper is focused on the non adaptive querying policies based 
on dyadic questions. The asymptotic minimum achievable distor¬ 
tion under such policies is derived. Furthermore, a policy named the 
Awr^//af0is exhibited which achieves asymptotically this distortion. 

Index Terms — target localization, 20 questions, dyadic policy, 
non adaptive policies 

1. INTRODUCTION 

Consider the following problem of localizing a one dimensional de¬ 
terministic target X G [0; 1] with a question/answer process. At 
each step, a subset of [0,1] is chosen and is questioned: “whether or 
not X belongs to this subset”. Assume for now that the answers are 
truthful, i.e., there is no noise. By repeating this querying, an esti¬ 
mator of X notated X is derived. The performance of such policy 
is measured by the supremum distance between the target and the 
estimator, sup^ \X — X\. Note that 

( 1 ) 

since at most n bits of information can be learned using n binary 
questions. Assume now that the subsets are intervals and the policy 
is non-adaptive. Then, 

which is achieved by choosing intervals of the form [0, for 

1 < i < n. Among adaptive policies, the lowest achievable error in 
^ is achieved using the dichotomy policy, consisting in splitting in 
two intervals of equal size the current interval containing the target. 
This performance can also be achieved using a non adaptive policy 
with question sets which are not constrained to be intervals. The 
dyadic policy, consisting in querying the bits of X in its expansion 

^The policy is named Aurelian to acknowledge Aurelien Garivier who 
provided the intuition for it. 


in base 2 is an example of a non-adaptive policy which achieves 
optimum performance. 

Consider now the situation when the answers are noisy accord¬ 
ing to a memoryless channel and where X is a sample from a known 
distribution. The performance is then measured using the mean 
square error. In this case, as shown in [?], 

£[(X - X„) 2 ] > ^1 exp(-^2n) ^1,^2 >0 ( 3 ) 

where the expectation is taken over X as well as the noise. Ai and 
A 2 are explicit functions of the entropy of the distribution of X and 
the characteristics of the noise, respectively. Similarly to the noise¬ 
less case, this result is derived by considering the maximum amount 
of uncertainty reduced in average by the question/answering process. 
Now, is this lower bound achievable? 

The bisection policy consists in choosing the interval [0, an] 
where an is the median of the posterior distribution of X after ob¬ 
serving the answer of the n — 1 first questions. It is an adaptive pol¬ 
icy which achieves the bound in albeit with different constants. 
A weaker statement of ^ is that the bisection policy achieves 

lira - \yiE[{X - Xnf\ = -C (4) 

n^oo n 

for some C > 0. A natural question consists in asking if there ex¬ 
ists a non adaptive policy achieving 0 while allowing for querying 
arbitrary sets, that is, which are not necessarily intervals. The an¬ 
swer to this question is unknown up to our knowledge. In order to 
progress in answering it, we consider a family of non-adaptive poli¬ 
cies. We start with the dyadic policy. Indeed this policy is optimal 
in the noiseless case. We also know that the dyadic policy is opti¬ 
mal for a different loss function which is the differential entropy of 
the posterior distribution, see [?] and that the differential entropy of 
the posterior need to be small for the mean square loss to be small. 
However, the reciprocal is not true. We extend the dyadic policy 
as follows: In the case where the prior distribution is Uniform, the 
dyadic questions correspond to the coefficients of X in its expansion 
in base 2. We allow each bit to be queried not only once but an ar¬ 
bitrary number of times under the constraint of a fixed total number 
of questions. This construction is extended to non Uniform priors 
by first transforming X into a Uniform random variable using the 
transformation X F(X) where F is the cumulative distribution 
of X and then considering the dyadic questions for F{X). 

2. INFORMATION TRANSMISSION EQUIVALENT 
PROBLEM 

The problem of interest can be mapped to the well-know point to 
point communication system [?] which transmits the location of the 





target through a memoryless channel. Consider the transmission of 
a message x in the interval [0,1] through a binary input, arbitrary 
output memoryless channel. The transmission aims at minimizing 
the mean square error between the input message and the decoded 
message. The analysis is presented in the case where the source is 
Uniformly distributed. This analysis is then extended to other dis¬ 
tributions in Section It is assumed that the source provides the 
binary representation of the message x\ 

oo 

x = ^Xk2-’^ ( 5 ) 

k = l 

Since the transmission of the infinite bit sequence xi, X 2 ,is not 
feasible in practice, the message should be quantized by selecting a 
finite number of bits, notate by 1. 

The encoding, shown in Figure consists in removing the 
source redundant information in order to gain compression effi¬ 
ciency. This is achieved by a quantization scheme. For the uniform 
source class, the scalar quantizer where the quantized levels are ex¬ 
plicitly represented via a linear combination of the x^’s has been 
proved to be optimal [?, ?] for the mean square error distortion. A 
source encoder with fixed rate I is assumed in this paper. The output 
of the source encoder is the first I bits of the infinite binary represen¬ 
tation of X. This quantized version is denoted as Xi in Figure[]J The 
truncated message is denoted xi = Ylk=i 

The channel encoder is designed to add redundancy to the source 
bit stream to protect the source information from the channel noise. 
The bits in the binary representation are of different importance due 
to the different weights in the sum|^ This motivates the coding 
policy which transmits each bit at a different rate. [?] proposed to 
transmit each I bits through a single binary symmetric channel with 
different transmission rate. This paper considers the general trans¬ 
mission scheme for which each bit is allowed to be transmitted a 
different number of times while the total number of bits to be trans¬ 
mitted is fixed. In Figure the channel encoder sends the n bits 
codeword Cn = {xi^, Xi^,Xi^) for which I < ij < 1. We 
denote q — maxj ij to be the maximum bit index selected for trans¬ 
mission. Note that q <1. For any bit k, we define 

n 

tn,k — ^ ^ 
i=i 

to be the number of times that bit k is transmitted in the codeword 
of length n. A transmission policy is then denoted by a vector tn = 
tn, 2 , ...) such that tn,k > 0 for my k > 1 and tn,k = 

n. 

The received message Yn = {yi ^, ^^ 2 , yi ^) is different from 
the transmitted message due to the channel noise. Here, we use a bi¬ 
nary input channel with the following channel posterior probability: 

PiY = y\X = x) = [^l^y\ * = ^ (7) 

where /o and fi are point mass functions or densities. 

The decoder is designed to minimize the end-to-end mean 
square error (MSB) between the decoded message Xn and the orig¬ 
inal message X, notated as E[{Xri — 

The communication system of interest is shown in Figure 
Here, the source encoder and the channel encoder are two sepa¬ 
rate entities, but there is a tight connection between them. This 
type of joint source-channel coding has been adressed in [?]. This 



Fig. 1. The general transmission interface of a continous message. 


paper considers only non-adaptive transmission policies. The non- 
adaptive policies are the policies that uses a one way communication 
between sender and receiver. On the other hand the adaptive poli¬ 
cies are corresponding to the system for which there is a noiseless 
feedback channel which provides sender the results of the previous 
transmission [?,?]. 

3. END-TO-END DISTORTION 

After transmission of n bits, the decoded message Xn is different 
from the source message X due to the quantization and transmission 
distortion. The MSB distortion at step n is defined as: 

E[{Xn - xf] = E[E[{Xn - Xf\Bn\] (8) 

here, E[.] denotes statistical expectation and Bn is the history of the 
transmissions of the previous bits. 

The square error distortion of Bq is minimized when Xn = 
E[X\Bn] which lead to the minimum end-to-end distortion: 

Dn = E[{E[X\Br,] - Xf\Bn] 

= S[var(X|B„)] (9) 

Bor a given history Bn the transmitted bits Xi and Xj , i,j < n 
and i j are independent and the minimum end-to-end distortion is 
therefore, using ([^, simplified as: 

Dn = y]2-"'=S[var(Xfe|Bn)] (10) 

k>l 

where Xk is the random variable corresponding to the k’ih bit of X 
in the binary representation of Bq[^ Since each bit k in the above 
sum is weighted differently, an unequal bit error protection trans¬ 
mission pattern is desired. The variable tn,k denotes the number of 
times that bit k has been transmitted within the first n transmissions. 

We now provide bounds on the distortion achieved by any such 
transmission scheme or equivalently on any non adaptive policy us¬ 
ing the dyadic questions. The proof involves classical large deviation 
inequalities and is provided in the Appendix. 

Theorem 1 For any transmission pattern tn, the minimum end-to- 
end distortion Dn is bounded as follows: 

L{tn) <Dn< U{tn) (11) 

where, 

k>l 

U{t„) = 2~^'°exp(-7n,fcC'(/i, /o)) 

fe>i 


( 12 ) 














Fig. 2. The distortion is bounded from above and below for all trans¬ 
mission policy in for n = 10 and the maximum depth of 3. 


C{fi, fo) is the chernoff information bound [1 ] and 


B{h,fo) = E 


exp 


infmi) 


(13) 


• there is no gap between bit indices selected for transmission, 
i.e. tn,k > 1 for 1 < k < q, where q is the last non-zero 
transmission bit index. 

• for any , /c 2 > 1, the difference between transmission val¬ 
ues of the corresponding bits is bounded from above and be¬ 
low: 

(fe - fci)r - 1 < tn,ki - tn,k2 < (fo - ki)r + 1 (14) 

where r = /n(4)/C(/i,/o). 


These properties are essential to derive the following lower 
bound on the minimum end-to-end distortion for all efficient poli¬ 
cies. 


Theorem 2 The logarithm of the minimum end-to-end distortion of 
any efficient non-adaptive transmission policy goes to —oo with a 
rate in 0{—^fn), more specifically: 


— Ai < lim 


\n{Dn{tn)) 

Vn 


for each sequence {tn}n>i of efficient policies, where 




— min(A/2( 


ln(4) 

C(/i,/o) 


+ l)S(/i,/o),ln(4)) 


Figure provides an illustration of theorem We show the 
distortion as well as the lower and the upper bound forn = 10 ques¬ 
tions and I = 3 bits source. The channel is binary with transitions 
probabilities po^o = 0.9,pi^i = 0.8. For this setting, C = 0.77 
and B = 2.08. There are only 66 possible transmissions patterns 
for which the distortion and the bounds are plotted in the figure. The 
experiments are done over 100 iterations. These policies are shown 
in the x axis. The policy t which minimizes i9io(t) is (6, 3,1) con¬ 
sisting in sending the first bit 6 times, the second, 3 times and the 
third once. 

4. AURELIAN CODING SCHEME 

The optimal non-adaptive policies are the ones which minimize the 
end-to -end distortion Dn. However, it is difficult to find such opti¬ 
mum transmission patterns from Eq[^ Instead, we define the effi¬ 
cient policies as follows: 

Definition 1 An efficient policy is a policy tn = (tn,i, tn, 2 ,...) that 
minimizes the upper bound U of the end-to-end distortion. 


argmin 

tn 

^''exp(-i„,feC'(/i,/o)) 

k>l 

subject to 

00 

^ ^ tn,k — til., tn^k G IM U {0} 
k=l 


As Figure shows, the upper distortion bound can have many 
local minima. Solving such an integer minimization problem might 
be hard directly. However, we can characterize some properties of 
an efficient transmission policy. 

Lemma 1 For any efficient transmission policy tn, 


Even though we derived some necessary important properties 
for policies that achieve the infimum of the upper bound, we did not 
provide an explicit efficient non-adaptive transmission policy. Our 
strategy is the following: We propose a policy, called the Aurelian 
policy, derived from the minimization of © for sequences of real 
numbers tn (Of course we then adapt the t’s such that we obtain a 
sequence of integers). 

Definition 2 Aurelian Policy : 



tn,i = qr 

tn,k = tn,l-{k-l)r 

r = Lln(4)/C(/i,/o)J (15) 

At this stage, even if the choice of the Aurelian policy is based 
on some ’’good” intuitive reasons (the minimization of the contin¬ 
uous problem), we do not have any insurance on the rate of conver¬ 
gence to 0 of the distortion rate when this policy is used, compared 
to the rate of convergence of an efficient non-adaptive transmission 
policy. Theorem 2 address this this problem. Eirst of all, we will de¬ 
rive a upper bound for the convergence rate to — oc of the distortion 
rate of the Aurelian policy. Having this rate, we will show that it is 
comparable to the distortion rate of an efficient non-adaptive trans¬ 
mission policy. In other words, we can conclude that we don’t loose 
’’too much” by following an Aurelian policy instead of deriving an 
explicit non-adaptive transmission policy. 

Theorem 3 The logarithmic rate of convergence of the Aurelian 
policy is no more than 0{—^/n). More precisely, 

lim sup < -A 2 (16) 

n^oo V n 

where A 2 = V^C{fi, fo). 















7. APPENDIX 



Fig. 3. The normalized distortion versus the number of transmis¬ 
sions as well as the lower bound, i.e the constant —Ai provided in 
Theorem 2. 

And finally, 


7.1. Proof of Theorem 1: 

7.J.J. Upper bound 

Proof: We first prove the follwoing inequality: 

E[var{Xj,\Br,)] < exp(-t^,feC(/i,/o)) ( 19 ) 

the upper bound is then derived by considering the fact that for any 
random variable A and scalar b, if A < b, then E[A] < b. Note: 

Var[Xk\Bn] = P(Xk = - P{Xk = l\Bn)) 

< min{P{Xk = 0\Bn),P{Xk = l\Bn)) 

= P{Xk^Xk\Bn) 

where Xk is the mode of p{Xk\Bn). The above equation can be 
simplified further as: 

P{Xk / Xk\Bn) = P{Xk = l\Bn,Xk = 0)PiXk = 0) 

+ P(Xk = 0\Bn,Xk = l)P(Xk = 1) 


Corollary 1 If we denote by Dn the distortion rate of the Aure- 
lian policy, and the distortion rate of any efficient non-adaptive 
transmission policy, then we have: 


, \n{Dn 


lim inf ■ , _ 
n^oG \n(D* 


“ A2 


(17) 


for Ai and A 2 given in Theorem^and Theorem^ respectively. 


The blue curve in Figurej^shows the asymptotic behavior of the 
aurelian policy. As this figure shows, after about 300 transmissions, 
the receiver is able to decode the message with very high confidence. 
It is also worth to mention that the rate of convergence is of order 
1/x/^. 


5. NON-UNIFORM DISTRIBUTION 

Note that the lower bound of Theoremj^generalizes to random vari¬ 
ables for which the cumulative distribution is Lipschitz continous. 
Indeed in this case, by definition, there exists a constant k > such 
that 


where. 


P{Xk = \\Bn,Xk=0) = P(y]lfcfe)log(f (j/i,) >0)) 


i=i 


/o 


n „ 

= P(exp(sy] Ifc(ij) logifivij))) > 1) 


3=1 


< 


P [exp(s Ife (ij) log( )))] 

i=i 

exp(-i„,feC'(/i,/o)) 


here, Ifc(^j) is the indicator of sending bit k at transmission step 
ij. The second equality is hold for any positive value s. The third 
inequality is derived using Markov inequality and the final equality 
holds based on the fact that by defnition. The 

fact that each transmission is independent of the rest and have same 
i.i.d distribution is also used for deriving the expectation of the last 
inequality. 


{E{ui) - E{u 2))^ < k{ui - U2)‘^ 


(18) 7.7.2. Lower bound 


The cumulative disttribution of any random varaible is a uniform 
random variable in interval [0,1]. Instead of locating target X, we 
can search for the target P{X). Lets Pn be the estimated location 
after n steps and let Xn = 7^~^(7^n)- 

(Fn - F{X)f < k{Xn - xf 

The lower bound of the distortion is then derived using Theoremj^ 


Next, Lets define the posterior function pn,/c = P{Xk = l|5n). 

Lemma 2 The posterior denisty on bit k at step n is pn,k — Pn-i,k 
whenever lk{in) = 0. Otherwise the posterior is a function of the 
posterior at previous transmission step: 


Pn,k 


_ fl{yijpn-l,k _ 

fl{yin)Pn-l,k + fo{yin){f ~ Pn-l,k) 


( 20 ) 


6. CONCLUSION 

The problem of noisy target localization under mean square distor¬ 
tion is addressed through an iterative binary question/answering pro¬ 
cess. Among non-adaptive policies, the policies involving dyadic 
questions were considered. It was shown that the logarithm of the 
mean square distortion is 0{—^/n) for n large enough. An explicit 
policy achieving this rate was exhibited. We finally conjecture that 
this policy is optimal among all non-adaptive policies. 


Prrof: First note that since bits are independent random variables, 
sending a bit different from k will not change the posterior at step n, 
thus pn,k — Pn-i,k when in k. In the other hand, whenever the 
bit k was sent through the channel at the n’th transmission: 

Pn,k = P[Xk = l\Bn) 

PjyiJXk = l,Bn-l)P{Xk = l\Bn-l) 

P{yi^\Bn-i) 











where, 




nG{0,l} 

= P(yi^\Xk = 1, Bn-l)P{Xk = l\Bn-l) 

+ PiVir.\Xk = 0 , Bn-l)P{Xk = 0\Bn-l) 

= fliyin)Pn-l,k + fo{yiJ{l -Pn-l,k) (21) 


we claim < U{tn): 


U{Q-U{tn) = 


< 


X 2 ^^(exp(4^fcC7) - exp(tn,fcC')) 
k>i 

— 1) + 2“^^^(1 — 

0 


Lemma 3 If in = k 


Pn,k{^ - Pn,k) > Pn-lA'i^ - P„-l,k)e ^k(in)\los( 


Proof: For in — k. 


Pn,A:(l Pn,k^ — 


> 


fliyin)MyiJPn-l,kil -Pn-l,k) 
[f'i^{yin)Pn — l,k P fo{yin){^ Pn —l,fc)]^ 

fl{yin)fo{yin)Pn-l,k{'^ -Pn-l,k) 


fli^yi-n^^ ^-1 P 

/o(?/in)- fiiviP- 

Pn —l,fc(l Pn —l,fc) 


foiVin) ^ 
flPin) 


/o(yi„)- 


+ 


/l(^in) 

foiVin) 


1 


foiVin) 

fliVir,) 


>1 


Pn-l,fc(l - Pn-l,k) exp(-| log 


h 

fo 


iViJl) 


Corollary 2 After transmission of n bits, 


Since 0 < C < 1 and ki < /c 2 .n 

Let us call the transmission policy derivable from tn by a 
(/ci, /c 2 )-move when 

{ tk,n + 1 if k = ki 

tk,n 1 if k — /C2 
tk,n Otherwise 

here it is assumed that tn,k 2 ^ otherwise the above move can not 
be defined. An optimal transmission policy is such that no further 
(/ci, A: 2 )-move can lead to a transmission policy with lower upper 
bound distortion. This is the key idea to prove the following lemma: 
Property 2: Lets tn be a coding policy and t'n be the policy de¬ 
rived by a (/ci, /c 2 )-niove between two bit indices ki and k 2 such 
that tn,k 2 ^ 1- This move reduce the upper bound values if 
U{tn) > U(tn), simplification of both side prove the following 
lower bound inequality 

(^2 k\)T 1 ^ tn,ki tn,k2 

Similarly, the other bound can be derived by considering situations 
that a (/c 2 , /ci)-move can reduce the upper bound. 


Pn,k(f - Pn,k) > P0,k{f - P0,k) ex.p{-tn,kB(fi,fo)) (23) 
where, B(/i,/o) = E[\ log(|^)|]. 


Corollary 3 Using the above properties, we can derive the follow¬ 
ing bounds for the last transmitted bit index q and the transmission 
number of first bit, tn,i: 


Proof: Using Lemmaj^recursively: 


Pn,k{f Pn,k^ P 


n 

Po,k{f -po,k) ]^exp(-lfe(ij)|log( 
i=i 




tn,i < q{r + l) 

q < i/2n+ 1/2 - 1/4 


n 

= po,fc(l -po,fc)exp(- lfc(lj)| log( 
1=1 


hiVij) 

foivij) 


The proof is very straigtforward using the properties and note 

= l ^n,k = 'kl. 


Taking expectation from both side and using Jensen inequality: 

E\vaT(^Xk\Bn^] — E\pn,k(^f Pn,k^] 


> po,fc(l - Po,k) ex.p{-tn,kE[\ log( 


My)_, 

My)' 


where the last equation derived based on the fact that there is only 
tn,k Steps that bit k has been sent and these steps are i.i.d. 

7.2. Optimal Properties 

Property 1: Lets assume there is a bit ki such that tn,ki = 0 and 
let k 2 be the first index greater ki such that tn,k 2 > 0- Lets t'n be a 


new policy made from the earlier 

policy tn, such that 

1 tn,k2 

k = ki 

tn,k = S 0 

k — k2 

\tn,k 

k ^ {ki,k2} 


7.3. Asymptotic Behavior of Infimum of the Efficient Policies 

Lets tn be an efficient policy. For such policy 

L{tn) = ^ [X 2~^^ exp(-tn,fe^)] 


k>l 

Q 


7 exp( '(-fc log(4) - tn,kB)) + 2 


k>q 


j[e''^^ex.p{vk-vi)p2 


> 


4 

j[«” 


k=i 


-a + 3^ 


-qlog( 4 )i 


(25) 


where a = /o^(4)(l P §) > log{4:), since B,C > 0. Using the 
Corollary!^ 


iii>log(4)-9(‘^ + l)B 


(26) 


















let n ^ oo and using the upper bound of q. 


lim inf ( 27 ) 

n^ootnET* y'n 

where Ai = min(v^(^^^^ + l)B,log(4)) which complete the 
proof. 

7.4. Asymptotic Behavior of Supremum of the Aurelian Policies 

For an Aurelian policy tn : Lets tn be an efficient policy. For such 
policy 

U(tn) = y^2~^^exp(-tn,fcC) 
k>l 
Q 

= exp((-fc log(4) - tn,kC)) + 

k=l k>q 

= y^exp(-log(4)+ 

k = l 

= 9 exp(- log(4) - qrC) + ^ exp(-7log(4))(28) 
For n ^ oo, since rC < log(4): 

lim sup < _^2 (29) 

where A 2 = ^/{2r)C. 



